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Japanese Laid-Open Patent Application Hei 4 - 54638 
SPECIFICATION 

1. Title of the Invention: 

ELECTRONIC COMPUTER 

2. Scope of Patent Claims: 
Claim 1 

An electronic computer wherein, in an electronic computer provided with a fetch circuit 
that simultaneously fetches multiple instructions, and an instruction distributor that 
distributes & supplies the multiple instructions fetched by the fetch circuit, to multiple 
execution units, respectively, and that allows them to execute [the instructions] in parallel, 
it is provided with a 1 st processing unit that decodes the multiple instructions fetched by the 
fetch circuit, respectively, and that detects the execution order dependency between each of 
the instructions, parallel to the distribution & supply processing by the instruction 
distributor. 

Claim 2 

The electronic computer according to Claim 1 , wherein, it is provided with a 2 nd processing 
unit that controls to abort the instructions, which have been supplied by the [execution] 
units, in accordance with the execution order dependency detected between [each of] the 
multiple instructions by the 1 st processing unit. 



Japanese Laid-Open Patent Application Hei 4 - 54638 

3. Detailed Description of the Invention: 
[Objective of the Invention] 
(Industrial Application Field) 

The present invention relates to an electronic computer that executes multiple instructions 
in parallel. 

(Prior Art) 

In recent years, the development of parallel processing type electronic computers, where 
the execution of multiple instructions in the minutely-graded level in parallel results in 
speeding up of the operation processing speed, has been variously attempted. In particular, 
recently, parallel processing type electronic computers referred to as a VLIW system, 
typified by, for example, "TRACE" by Multiflow Computer, Inc. and "i860" by Intel 
Corporation, have been developed, and have attracted some notice. However, in this type 
of conventional system electronic computer, there is still an essential problem such that 
conventional general sequential instruction codes cannot be executed. 

Further, for the purpose of executing conventional sequential instruction codes in parallel 
while the compatibility of their object is maintained, it is necessary to test the sequentially 
provided possibility of the simultaneous execution of multiple instructions. However, for 
the purpose of conducting this test, it is necessary to increase, for example, the number of 
logic stages per cycle, as a result of which the cycle time must be increased. 

In the meantime, as a technique for analyzing the dependency between each of the multiple 
instructions for the purpose of supporting the execution of instruction(s) having latency, a 
register scoreboard method has been known. In this register scoreboard method, as seen 
for example, in "M88000" by Motorola, Inc., instructions that have been fetched to an 
instruction buffer are analyzed (decoded) by a register field decoder, and source register 
numbers and destination register numbers of the instructions obtained in this analysis 
(decoding) processing, are provided to the register files. At the same time, these are 
provided to a register scoreboard circuit. The dependency between each of the multiple 
instructions is the analyzed by the register scoreboard circuit, and the execution order of 
the instructions is controlled in accordance with the analytical results. 
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Consideration is given to analyzing the possibility of the simultaneous execution of the 
instructions according to the dependency between each of the multiple instructions to be 
processed in parallel, using multiple instruction execution units, by utilizing the register 
scoreboard method. For example, sequential multiple instructions are fetched to the 
instruction buffer; and the types of these instructions are determined by an instruction 
distributor, respectively. Execution units suitable for executing each of the instructions are 
selected in accordance with the determined results, respectively, and each of the 
instructions is distributed & supplied [to the selected execution unit], respectively; and 
executed in parallel. 

Specifically, various instruction execution units, such as a floating point multiplication unit, 
a floating point addition unit and an arithmetic logical unit (ALU), are prepared, and 
instructions suitable for the instruction execution functions are distributed & supplied to 
the multiple instruction execution units, respectively, with each of the instructions being 
executed in parallel. 

In this case, the dependency between each of the instructions is analyzed using the register 
field decoder and the register scoreboard circuit respectively incorporated in each of the 
instruction execution units. 

Since the dependency between each o£lhe instructions is analyzed using the register 
scoreboard circuit in each of the instruction execution units after the instructions are 
distributed & supplied to the instruction execution units via the instruction distributor, 
respectively, there is the problem that it takes a long time to process. In other words, there 
is a problem that a long time is required for testing the possibility of the simultaneous 
execution of multiple instructions. 

(Problems Overcome by the Invention) 

As described above, in the case of processing conventional general sequential instruction 
codes in parallel while maintaining the compatibility of their object, it is necessary to test 
the execution order dependency between each of multiple instructions, and to control the 
simultaneous execution of the multiple instructions. However, if the dependency between 
each of the instructions is to be analyzed using the conventional register scoreboard 
method as it is, after instructions are distributed & supplied to the instruction execution 
units, the dependency between each of the instructions is analyzed by each of the 
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instruction execution units, respectively, with the problem such that it takes a long time to 
process. 

The present invention gives due regard to these circumstances, and has the objective of 
providing an electronic computer where the execution order dependency between each of 
multiple instructions is detected at high speed, thereby enabling efficient parallel 
processing of multiple instructions. 

[Construction of the Invention] 

(Problem Resolution Means) 

The present invention relates to an electronic computer provided with a fetch circuit that 
simultaneously fetches multiple instructions, and an instruction distributor that distributes 
& supplies instructions fetched by this fetch circuit, to multiple execution units, 
respectively, and that allows the execution units to execute the instructions in parallel. 
It is characterized by the fact that in parallel to the distribution & supply processing of the 
[multiple] instructions by the instruction distributor, it is provided with: 

a 1 st processing unit that decodes the multiple instructions, which have been fetched by the 
fetch circuit, and that detects the execution order dependency between each of the multiple 
instructions; and 

a 2 nd processing unit that controls aborting the instructions supplied to the execution units, 
in accordance with the execution order dependency, detected between each of the multiple 
instructions by this 1 st processing unit; and allows the multiple execution units to execute 
the instructions in simultaneously executable order, respectively. 

(Operation) 

According to the present invention, multiple instructions fetched by the fetch circuit are 
distributed & supplied to the multiple execution units according to the type of the 
instructions. Parallel to the processing of the decoding and execution of the instructions at 
each of the execution units, multiple instructions fetched by the fetch circuit, are directly 
decoded by the 1 st processing unit, and processing to detect the execution order 
dependency between each of the multiple instructions is simultaneously executed, the 
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execution of the instructions by the multiple execution units being controlled in accordance 
with this detection results. Therefore, for example, in the case of detecting instruction(s) 
where the possibility of simultaneous execution is denied according the detection 
processing of the execution order dependency, it is possible to promptly control aborting 
the execution of the instruction(s) by the execution unit(s) where the instruction(s) has 
been distributed & supplied, respectively. In other words, without having a temporal delay 
because of detecting the dependency between each of the instructions at an individual 
execution unit where the instruction(s) has been distributed & supplied, a simultaneously 
unexecutable instruction(s) is promptly detected, and the execution of the instruction(s) 
can be efficiently aborted. 

(Embodiment) 

An electronic computer relating to one embodiment of the present invention is explained 
hereafter, with reference to the drawings. 

Fig. 1 shows a schematic construction of the primary components of the electronic 
computer relating to the embodiment, wherein the symbol ' 1' is an instruction buffer (fetch 
circuit) that fetches sequential instructions, multiple instructions at a time. Here, the 
instruction buffer 1 constructed to simultaneously fetch four instructions, II, 12, D and 14, 
is shown. 

The four instructions, II , 12, 13 and 14 fetched by the instruction buffer 1 , are distributed & 
supplied to multiple instruction execution units 3 (3a, 3b, ... 3n) according to the types of 
instructions, via an instruction distributor 2, respectively. In this embodiment, as the 
multiple instruction execution units 3, a floating point addition unit (ADD) 3a, a floating 
point multiplication unit (MUL) 3b, and arithmetic logical units (ALU) 3c & 3d are 
prepared. In the case that an instruction fetched by the buffer 1, is a floating point addition 
instruction, the instruction is supplied to the floating point addition unit (ADD) 3a by the 
instruction distributor 2, and in the case that an instruction is a floating point multiplication 
instruction, the instruction is supplied to the floating point multiplication unit (MUL) 3b. 
In addition, in the case that an instruction is an integral number operation instruction, the 
instruction is distributed & supplied to the arithmetic logical units (ALU) 3 c and 3d, 
respectively. 

Each of these instruction execution units 3 is provided with a decoder 4, respectively, and 
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basically, instructions that have been selectively distributed & supplied from the 
instruction buffer 1 are respectively decoded under the control of the instruction distributor 
2, and a register is accessed in accordance with the data in the register field indicated in the 
decoding result, and the instructions are executed. 

In the meantime, the four instructions, II , 12, 13 and 14 fetched by the instruction buffer 1 
are supplied to the instruction distributor 2, and are distributed & supplied to the instruction 
execution units 3, and are simultaneously provided to a simultaneous execution possibility 
detector (l 8t processing unit) 5. The simultaneous execution possibility detector 5 is 
provided with: 

a decoder 5a, which decodes instructions provided from the instruction buffer 1, 
respectively, and 

a detector 5b, which tests the decoding result and determines the possibility of the 
simultaneous execution between each of the multiple instructions. The possibility of the 
simultaneous execution between each of the four instructions, II, 12, D and 14 fetched by 
the instruction buffer 1 is determined using the decoder 5a and the detector 5b. 

Specifically, for the determination of the possibility of simultaneously executing the 
multiple instructions, for example, in the case that the instructions II, 12, ... are provided, 
then: 

11 [ADD r3, r4, r5] 

12 [ADD r6, r7, r8] 

For example, instruction II is regarded as a standard and the execution condition of the 
next instruction 12 is determined. In this case, since all of the register fields of the 
instruction 12 are different from the destination register filed r3 of the instruction II , it is 
determined that the instruction 12 is simultaneously executable along with the instruction 
II . Determination processing is performed by sequentially determining whether the 
execution of the next instruction can be realized relative to a predecessor instruction. In 
other words, the determination of the possibility of simultaneous execution is similarly 
performed to the next instruction 13 relative to the instructions II and 12, respectively. For 
the next instruction 14, the determination is similarly performed relative to the instructions 
II, 12 and 13, respectively. 
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In the case of instructions II , 12: 



11 [ADD 

12 [ADD 



r3, 
r6, 



r4, 
r7, 



r5] 
r3] 



As shown in Fig. 2, if the instruction II is regarded as a standard and the execution 
condition of the next instruction 12 is determined, the source register field r3 in the 
instruction 12 is detected as being consistent with the destination register field r3 in the 
instruction II . As a result, the simultaneous execution possibility detector 5 determines 
that the instruction 12 needs to be executed after the execution of the instruction II is 
completed. In other words, it is impossible to simultaneously execute instruction 12 and 
instruction II. 

In order to abort the execution of instructions which have been determined as being 
simultaneously unexecutable by the simultaneous execution possibility detector 5, an abort 
signal supplier (2 nd processing unit) 6 issues an abort signal to the instruction execution 
unit 3 to which the instruction has been supplied. The abort signal is issued to the 
instruction [sic] where the simultaneous execution possibility has been denied, and the 
determination about to which instruction execution unit the abort signal is provided is 
made as a result of a search about which instruction execution unit the instruction will be 
distributed (or has been distributed) according to the type of the instruction, similar to the 
instruction distributor 2. When the abort signal is provided to the instruction execution unit 
3, as described above, the execution of the instructions are aborted which have been 
provided via the instruction distributor 2. 

In the case that the execution of one instruction has been aborted, the fetching of new 
instructions (multiple instructions) by the instruction buffer 1 is terminated. In the next 
cycle, re-execution of the aborted instruction is performed. In this case, it is unnecessary to 
re-execute other instructions where execution has already commenced, in which case it is 
also unnecessary to check the possibility of simultaneous execution between instructions 
requiring execution, and other instructions where execution has already been completed. 
Therefore, whether the execution of only the instruction which caused the denial of the 
possibility of the simultaneous execution has been completed is checked, and the 
instruction is re-executed. Re-execution processing of the instruction enables the 
execution of all instructions fetched by the instruction buffer 1, and after one-cycle the 
instruction buffer 1 fetches new instructions (multiple instructions). Parallel execution of 
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these instructions by the multiple instruction execution units 3 is similarly controlled. 

An electronic computer constructed as described above, for example, where fetching of 
instructions (instruction access), decoding of the instructions (including the detection of 
the possibility of the simultaneous execution) and the execution of the instructions are 
divided into stages, and pipeline processing is performed, making it possible to determine 
the possibility of simultaneous execution between each of the multiple instructions in the 
decoding stage. In other words, the processing of the determination for the possibility of 
simultaneous execution can be performed within the processing time for the decoding 
stage, without re-setting the processing time required for the determination of the 
possibility of simultaneous execution between each of the instructions. Therefore, even in 
the case of executing multiple instructions in parallel, it is unnecessary to establish the 
processing stage only for this purpose, making it possible to efficiently perform the 
processing of multiple instructions in parallel at high speed. 

In other words, according to the present invention, separating the register field included in 
each instruction execution unit from the decoder for the purpose of accessing the register, a 
register field decoder for the purpose of determining the possibility of the simultaneous 
execution between each of multiple instructions is established, and the possibility of the 
simultaneous execution between each of the instructions is determined, and the 
simultaneous execution of the instructions by the multiple instruction execution units is 
controlled using the register field decoder, independent from the instruction distributor 2. 
Therefore determination of the possibility of simultaneous execution between each of 
instructions can be performed at high speed, the cycle time and the processing execution 
time can be reduced, and the computer's performance can be drastically improved. In 
addition, while the instruction distributor 2 distributes multiple instructions to the multiple 
instruction execution units, simultaneous execution of the instructions can be controlled in 
accordance with the determination results relating to the possibility of simultaneous 
execution. Hence, it is greatly effective in practical use; for example, and while 
maintaining the compatibility between the conventional sequential instruction codes and 
their objects, it enables the efficient parallel processing of the instructions. 

However, the present invention is not limited to this embodiment. For example, an 
embodiment has been explained using an instance where four instructions are processed in 
parallel. However, this could be any number. However, it is necessary to test the 
possibility of the simultaneous execution (instruction execution order) between each of the 
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instructions, respectively. Further, the number of the instruction execution units and their 
type can be established according to the specifications of the computer. Other than that, the 
present invention can be implemented with various modifications within the scope of the 
outline. 

[Efficacy of the Invention] 

As described above, according to the present invention, while a 'binary compatibility' 
(compatibility [sic.]) between the sequential instruction codes and their object is 
maintained, it is possible to perform parallel processing of multiple instructions, and is 
greatly effective for the practical use. For example, the reduction of the processing 
execution time enables the efficient execution of instructions. 
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4. Brief Description of Drawings: 

Fig. 1 is a schematic block diagram of primary components in the electronic computer 
relating to one embodiment of the present invention, and 

Fig. 2 is diagram that shows a basic concept of the determination processing of the 
simultaneous execution possibility to multiple instructions as a pattern. 

1 ... instruction buffer (fetch circuit), 2 ... instruction distributor, 3 ... instruction 
execution unit, 4 ... decoder, 5 ... simultaneous execution possibility detector (1 st 
processing unit), 5a ... decoder, 5b ... detector, 6 ... abort signal supplier (2dn processing 
unit) 



Agent of patent applicant: Patent agent: Takehiko SUZUE 
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(1) The descriptions of "Scope of Patent Claims" are corrected as in the separate paper. 

(2) The description from the 7 th line of the 6 th page to the 2 nd line of the 7 th page in the 
Specification is corrected as follows: 

Record 

The electronic computer relating to the present invention is characterized by the fact that it 
is provided with: 

a fetch means that fetches multiple instructions; 

an instruction supply means that performs the processing to supply the multiple 
instructions fetched by the fetch means, to multiple execution means; and 
a 1 st processing means that detects the dependency of the execution order between each of 
the multiple instructions fetched by the fetch means, parallel to the processing to supply 
[the multiple instructions] by the instruction supply means. 

Preferably, it can be additionally provided with a 2 nd processing means that controls 
aborting the instructions supplied to the execution means, in accordance with the 
dependency of the execution order detected between each of the multiple instructions by 
the 1 st processing means. 

(3) From the 14 th lines of the 7 th page to the 13 th line of the 7 th page [sic] in the 
Specification, the description, "fetched by the fetch circuit ... the execution of the 
instructions by the multiple execution units is controlled", is corrected to, "parallel to 
the processing to supply multiple instructions fetched by the fetch circuit, to multiple 
execution means according to the types of the instructions, respectively, the processing 
to detect the dependency of the execution order between each of the multiple 
instructions fetched by the fetched means, is simultaneously executed by the 1 st 
processing means, and the instruction execution by the multiple execution means is 
controlled in accordance with the detection results " 

(4) The term, "execution unit" in the 16 th line of the 7 th page [sic] in the Specification is 
corrected to "execution means". 

(5) The term, "execution unit" in the 18 th line of the 7 th page [sic] in the Specification is 
corrected to "execution means". 
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2. Scope of Patent Claims: 
Claim 1 

An electronic computer, wherein, it is provided with 

a fetch means that simultaneously fetches multiple instructions, 
an instruction supply means that performs the processing to supply the multiple 
instructions, which have been fetched by the fetc h means, to multiple execution means. 
respectively, and 

a l 8t processing means that detects the execution order dependency between each of the 
multiple instructions fetched bv the fetch means , and accomplishes parallel processing to 
supply [the instructions] by the instruction supply means . 

Claim 2 

The electronic computer according to Claim 1, wherein, it is additionally provided with a 
2 nd processing means that controls aborting the instructions supplied to the execution 
means , in accordance with the execution order dependency detected between [each of] the 
multiple instructions bv the 1 st processing means . 
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